首页> 外文OA文献 >GPU Asynchronous Stochastic Gradient Descent to Speed Up Neural Network Training

【2h】

GPU Asynchronous Stochastic Gradient Descent to Speed Up Neural Network Training

机译：GpU异步随机梯度下降加速神经网络训练

页面导航

摘要
著录项
相似文献
相关主题

摘要

The ability to train large-scale neural networks has resulted instate-of-the-art performance in many areas of computer vision. These resultshave largely come from computational break throughs of two forms: modelparallelism, e.g. GPU accelerated training, which has seen quick adoption incomputer vision circles, and data parallelism, e.g. A-SGD, whose large scalehas been used mostly in industry. We report early experiments with a systemthat makes use of both model parallelism and data parallelism, we call GPUA-SGD. We show using GPU A-SGD it is possible to speed up training of largeconvolutional neural networks useful for computer vision. We believe GPU A-SGDwill make it possible to train larger networks on larger training sets in areasonable amount of time.

机译：训练大规模神经网络的能力已导致在计算机视觉的许多领域具有最先进的性能。这些结果主要来自两种形式的计算突破：模型并行性，例如GPU加速训练，已在计算机视觉界迅速采用，并且数据并行性例如A-SGD的大规模使用已广泛应用于工业中。我们报告了使用模型并行性和数据并行性的系统进行的早期实验，我们将其称为GPUA-SGD。我们展示了使用GPU A-SGD可以加快对计算机视觉有用的大型卷积神经网络的训练。我们相信GPU A-SGD将使在可能的时间内在更大的训练集上训练更大的网络成为可能。

著录项

作者
Paine, Thomas; Jin, Hailin; Yang, Jianchao; Lin, Zhe; Huang, Thomas;
展开▼
作者单位

展开▼
年度 2013
总页数
原文格式 PDF
正文语种 {"code":"en","name":"English","id":9}
中图分类

相似文献

外文文献
中文文献
专利

1. Non-convergence of stochastic gradient descent in the training of deep neural networks [J] . Cheridito Patrick, Jentzen Arnulf, Rossmannek Florian Journal of complexity . 2021,第Juna期

机译：深神经网络训练中随机梯度下降的非融合
2. Accelerating deep neural network training with inconsistent stochastic gradient descent [J] . Wang Linnan, Yang Yi, Min Renqiang, Neural Networks: The Official Journal of the International Neural Network Society . 2017,第期

机译：加速深度神经网络训练，随机梯度下降不一致
3. FUZZY RULE EXTRACTION FROM A FEED FORWARD NEURAL NETWORK BY TRAINING A REPRESENTATIVE FUZZY NEURAL NETWORK USING GRADIENT DESCENT [J] . ROELOF K. BROUWER International Journal of Uncertainty, Fuzziness, and Knowledge-based Systems . 2005,第6期

机译：通过使用梯度下降训练代表性的模糊神经网络，从前馈神经网络中提取模糊规则
4. Improving training time of deep neural networkwith asynchronous averaged stochastic gradient descent [C] . You Zhao, Xu Bo International Symposium on Chinese Spoken Language Processing . 2014

机译：异步平均随机梯度下降法缩短了深度神经网络的训练时间
5. An Investigation of Stochastic Gradient Descent Dynamics of Neural Networks [D] . Luo, Victor. 2021

机译：神经网络随机梯度下降动力学研究
6. Mutual Information Based Learning Rate Decay for Stochastic Gradient Descent Training of Deep Neural Networks [O] . Shrihari Vasudevan 2020

机译：基于互动信息的学习速率衰减用于深神经网络的随机梯度血统训练
7. Damped Newton Stochastic Gradient Descent Method for Neural Networks Training [O] . Jingcheng Zhou, Wei Wei, Ruizhi Zhang, 2021

机译：用于神经网络训练的阻尼牛顿随机梯度下降方法

GPU Asynchronous Stochastic Gradient Descent to Speed Up Neural Network Training

摘要

著录项

相似文献

相关主题

期刊订阅